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(54) PERSONAL FACE RECOGNIZING DEVICE AND ITS METHOD 

(57)Abstract: 

PROBLEM TO BE SOLVED: To make it possible to execute 
quick and accurate recognition processing for the image of a 
personal face. 

SOLUTION: The image recognition device is provided with 
an fetching part 1 for fetching a target image, a reliability 
calculation part for comparing dictionary data 7 to 9 related 
to previously stored luminance frequency with the 
luminance values of plural unit areas of the target image and 
obtaining the reliability of plural unit areas of the target 
image, and a collation part 6 for comparing a face collation 
dictionary 12 previously stored for recognition with the 
target image in every unit area based on the calculated 
reliability and collating the target image. 
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CLAIMS 

[Claim(s)] 

[Claim 1] Image recognition equipment characterized by providing the following. An image input 
means to input an image for recognition in image recognition equipment which recognizes an image 
for recognition A dictionary creation means to draw up a dictionary for collating from an image for 
recognition inputted with this input means A brightness frequency dictionary held in quest of 
brightness frequency of each pixel in an image for recognition based on two or more dictionary data 
created with this creation means ** which has a collating means which collates by considering 
reliability about each pixel of an input image~based on brightness of each pixel of said brightness 
frequency dictionary and input image in the case of this collating while collating an input image and 
a dictionary drawn up with said creation means at the time of recognition of an image for recognition 

[Claim 2] It is image recognition equipment characterized by making it not use said collating means 
for collating in image recognition equipment according to claim 1 about a pixel which seemingly will 
not be reasonable as an image for recognition based on said brightness frequency dictionary. 
[Claim 3] It is image recognition equipment characterized by said collating means making a 
contribution to a collating result low in image recognition equipment according to claim 1 about a 
pixel which seemingly will not be reasonable as an image for recognition based on said brightness 
frequency dictionary. 

[Claim 4] It is image-recognition equipment which said brightness frequency dictionary has a 
brightness frequency dictionary for a face field extract held in quest of brightness frequency of a 
logging image at the time of dictionary data origination by said creation means in image-recognition 
equipment according to claim 1, and is characterized by for said extract means to have a face field 
extract means which cuts down an image for recognition based on said brightness frequency 
dictionary. 

[Claim 5] In image recognition equipment according to claim 4 said dictionary creation means A 
parts dictionary about a parts image of a specific region in an image for recognition is drawn up. Said 
said brightness frequency dictionary has a brightness frequency dictionary for a kind parts extract 
held in quest of brightness frequency of a parts image of parts dictionary creation time by said 
creation means. Said extract means is image recognition equipment characterized by having a kind 
parts extract means which cuts down an image for recognition to a kind parts image extracted with 
said face field extract means based on said brightness frequency dictionary. 

[Claim 6] In image recognition equipment according to claim 5, said dictionary creation means draws 
up a face collating dictionary about an image for recognition of a specific region in an image for 
recognition. Said said brightness frequency dictionary has a brightness frequency dictionary for face 
collating held in quest of brightness frequency of an image for recognition of a face collating 
dictionary by said creation means. Said extract means has a collating positioning means which 
chooses an image for recognition of a specific region in an input image from relation with a kind 
parts image extracted with a kind parts extract means as an image for collating. Said collating 
means is image recognition equipment characterized by being based based on said brightness 
frequency dictionary for face collating, and collating using a pixel which will seemingly be reasonable 
as an image for recognition when collating a face collating dictionary and an image for collating. 
[Claim 7] In an image recognition method of recognizing an image for recognition, a dictionary for 



collating is drawn up from an inputted image for recognition. While holding in quest of brightness 
frequency of each pixel in an image for recognition based on two or more of these created 
dictionary data and collating an input image and said dictionary at the time of recognition of an 
image for recognition An image recognition method characterized by collating by considering 
reliability about each **** of an input image based on brightness of each pixel of said brightness 
frequency dictionary and input image on the occasion of this collating. 

[Claim 8] It is image recognition equipment characterized by making it not use for collating in an 
image recognition method according to claim 7 about a pixel which seemingly will not be reasonable 
as an image for recognition based on said brightness frequency dictionary when collating said input 
image and said dictionary. 

[Claim 9] It is image recognition equipment characterized by making a contribution to a collating 
result low in an image recognition method according to claim 7 about a pixel which seemingly will not 
be reasonable as an image for recognition based on said brightness frequency dictionary when 
collating said input image and said dictionary. 

[Claim 10] In an image recognition method of recognizing an image for recognition, an image field for 
recognition is started from an inputted image. A face collating dictionary for collating is drawn up 
from an image for recognition of a specific region in an image of a started image field for recognition. 
In quest of brightness frequency of each pixel in a ********(ed) image field for recognition, a 
brightness frequency dictionary for a face field extract is drawn up. At the time of recognition of an 
image for recognition, a field which will seemingly be reasonable as an image for recognition is 
extracted as an image for recognition based on said brightness frequency among images in an input 
image. An image recognition method characterized by choosing an image for collating from an image 
for recognition of a specific region in this extracted image for recognition, and making it collate with 
said face collating dictionary. 

[Claim 1 1] In case said face collating dictionary is drawn up, while drawing up a parts dictionary 
about a parts image of a specific region in an image for recognition in image recognition equipment 
according to claim 10 In drawing up a brightness frequency dictionary for a parts extract in quest of 
brightness frequency of a parts image of parts dictionary creation time and performing collating with 
said face collating dictionary and input image Based on a brightness frequency dictionary for a parts 
extract, and a parts dictionary, a parts image of an input image to a specific region is extracted from 
an image for recognition extracted based on said brightness frequency dictionary for a face field 
extract. An image recognition method characterized by choosing physical relationship of an 
extracted parts image to an image for collating from an input image, and making it collate with said 
face collating dictionary. 

[Claim 12] The image-recognition method characterized by to collate based on said brightness 
frequency dictionary for face collating using the pixel which will seemingly be reasonable as an 
image for recognition when ****** the brightness frequency dictionary for face collating held in an 
image-recognition method according to claim 10 in quest of brightness frequency of an image for 
recognition of a face collating dictionary in case said face collating dictionary is drawn up and 
collating said face collating dictionary and the image for collating. 

[Claim 13] Image recognition equipment which is characterized by providing the following and which 
recognizes an image for recognition An image input means to input an image for recognition A focus 
extract means to extract the focus from a sample image, inputted with this input means A dictionary 
creation means to form a partial dictionary pattern showing characteristic quantity of a partial image 
near [ which was extracted with this focus extract means / each ] the focus, and to draw up a 
partial dictionary of each focus A collating means to perform collating with characteristic quantity 
which extracted each focus from an input image inputted with said input means with said focus 
extract means at the time of recognition of an image for recognition, and was obtained from a partial 
image near [ each ] the focus, and a partial dictionary pattern of said partial dictionary 
[Claim 14] A geometry change of a point with a unique brightness [ set to image recognition 
equipment of claim 13, and / means / said / focus extract ] change in a small field in an image for 
recognition or brightness is image recognition equipment characterized by extracting a unique point 
as the focus. 

[Claim 15] Image recognition equipment characterized by holding only a partial dictionary pattern 



which evaluated redundancy, about two or more partial dictionary patterns obtained from a sample 
image, and was made into non-redundancy in image recognition equipment of claim 13 at the time of 
dictionary construction by said dictionary creation means in a partial dictionary. 
[Claim 16] It is image recognition equipment characterized by complementing characteristic quantity 
in image recognition equipment of claim 13 using an approximation function which does not hold two 
or more partial dictionary pattern **** obtained from a sample image , but holds a partial dictionary 
pattern in which characteristic quantity is shown for every predetermined gap , and is 
complemented about the section in the meantime at the time of dictionary construction by said 
dictionary creation means . 

[Claim 17] It is image recognition equipment which creates and holds a contribution coefficient 
which contributes to recognition according to a use to each partial dictionary pattern in image 
recognition equipment of claim 13, and is characterized by said collating means collating by 
considering a contribution coefficient. 

[Claim 18] Image recognition equipment characterized by to use a set of a partial dictionary pattern 
effective in recognition on the occasion of collating by said collating means, creating and holding a 
contribution coefficient which contributes to recognition according to a use to each partial 
dictionary pattern in image recognition equipment of claim 13, and choosing from distribution of a 
contribution coefficient according to a use of recognition. 

[Claim 19] While having a brightness frequency dictionary held in quest of brightness frequency of 
each pixel in an image for recognition based on dictionary data in image recognition equipment of 
claim 13 and collating an input image and a dictionary drawn up with said creation means at the time 
of recognition of an image Image recognition equipment characterized by having having a collating 
means which collates by considering reliability about each pixel of an input image based on 
brightness of each pixel of said brightness frequency dictionary and input image in the case of this 
collating. 

[Claim 20] In an image recognition method of recognizing an inputted image for recognition, the 
focus is extracted from an inputted sample image. Form a partial dictionary pattern showing 
characteristic quantity of a partial image near [ this / that was extracted / each ] the focus, and a 
partial dictionary of each focus is drawn up. An image recognition method characterized by 
performing collating with characteristic quantity which extracted said focus from an inputted input 
image at the time of recognition of an image for recognition, and was obtained from a partial image 
near [ each ] the focus, and a partial dictionary pattern of said partial dictionary. 
[Claim 21] An image recognition method characterized by setting to said focus extract and 
extracting a point with a unique brightness change in a small field in an image for recognition, or a 
point with a unique geometry change of brightness as the focus in an image recognition method of 
claim 20. 

[Claim 22] An image recognition method characterized by holding only a partial dictionary pattern 
which evaluated redundancy about two or more partial dictionary patterns obtained from a sample 
image by said dictionary creation time, and was made into non-redundancy in an image recognition 
method of claim 20 in a partial dictionary. 

[Claim 23] It is the image-recognition method characterized by complementing characteristic 
quantity using an approximation function which does not hold two or more partial dictionary pattern 
3)eitc*9t( obtained from a sample image by said dictionary creation time, but holds a partial dictionary 
pattern in which characteristic quantity is shown for every predetermined gap, and is complemented 
about the section in the meantime in an image recognition method of claim 20. 
[Claim 24] It is the image recognition method which creates and holds a contribution coefficient 
which contributes to recognition according to a use to each partial dictionary pattern in an image 
recognition method of claim 20, and is characterized by said collating means collating by considering 
a contribution coefficient. 

[Claim 25] An image-recognition method characterized by to use a set of a partial dictionary 
pattern effective in recognition on the occasion of collating by said collating means, creating and 
holding a contribution coefficient which contributes to recognition according to a use to each partial 
dictionary pattern in an image recognition method of claim 20, and choosing from distribution of a 
contribution coefficient according to a use of recognition. 



[Claim 26] The image-recognition method characterized by to have collating by considering 
reliability about each pixel of an input image in the case of this collating based on the brightness of 
each pixel of said brightness frequency dictionary and input image while draw up the brightness 
frequency dictionary which holds in quest of brightness frequency of each pixel in an image for 
recognition based on dictionary data in the image-recognition method of claim 20 and collating an 
input image and said dictionary at the time of recognition of an image. 

[Claim 27] An image recognition method characterized by providing the following A taking-in 
production process which captures an image of an object containing a person's face image A focus 
detection production process of detecting the focus about an object image captured at the 
aforementioned taking-in production process based on the singular point of distribution of 
brightness in a predetermined field of this object image A contribution coefficient production 
process of computing two or more contribution coefficient which shows effectiveness for collating 
about two or more set semi- data of the focus for recognition stored beforehand, respectively, and 
storing it The collating production process which compares the remaining two or more set semi— 
data at the time of redundant criteria data being deleted at said criteria data deletion production 
process of deleting two or more redundant criteria data in consideration of the redundancy of 
criteria data, and said criteria data deletion production process with the focus detected at said 
focus detection production process for two or more of said unit fields of every based on said 
contribution coefficient stored at said contribution coefficient production process, and collates said 
object image _ _ _ . 
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DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[The technical field to which invention belongs] This invention relates to the person face recognition 
equipment which is image recognition equipment which recognizes the target image pattern, and 
recognizes especially human being's face, and its person face recognition method. 
[0002] 

[Description of the Prior Art] recently, although a user has the needs of little individual collating of a 
burden by non-contact in the field which went as the password of OA equipment, such as individual 
collating of social systems, such as close recession management, an ATM device, and a ticket 
machine, and a personal computer, the image recognition **** method attracts attention as what 
does not use not much special equipment in that case. 

[0003] As for this, what prepares two or more face images of a test subject, considers as a logging 
dictionary, performs pattern matching of an input face image and a dictionary at the time of 
recognition, and performs face location extract and collating is common. As a dictionary, in order to 
make it correspond to various conditions in each people, there are some which prepare the image of 
two or more sheets, but since dictionary size becomes huge if it remains as it is, there is a method 
which draws up the dictionary which carried out the feature compression of these statistically, and 
what is similar to analysis of a principal component as statistical compression is used. 
[0004] 

[Problem(s) to be Solved by the Invention] However, by the method by the above-mentioned 
conventional image recognjtion, there is a problem that exact recognition is difficult, in the 
environment which does not have little lighting fluctuation. Lighting fluctuation has brightness 
change in the whole screen, and the partial change by brightness inclination or the shadow, and the 
former is received. Although there is an example which performed normalization by the image 
magnitude of a vector (norm) about the dictionary and the input image, or performed histogram 
flattening and there is an example which carried out the plane reliance panel of the brightness in a 
screen to the latter There is a problem that a face extract goes wrong even if it carries out with 
them, or collating becomes incorrectness. 

[0005] By being made that the above-mentioned problem should be solved, calculating the reliability 
for every smallness field of the field which matches, and performing matching according to this 
reliability, this invention realizes quick and exact matching and aims at offering the person face 
recognition equipment which performs recognition processing quick [ a person's face ] and exact by 
this, and this person face recognition method. 
[0006] 

[Means for Solving the Problem] An image input means by which this invention inputs an image for 
recognition in image recognition equipment which recognizes an image for recognition, A dictionary 
creation means to draw up a dictionary for collating from an image for recognition inputted with this 
input means, At the time of a brightness frequency dictionary held in quest of brightness frequency 
of each pixel in an image for recognition based on two or more dictionary data created with this 
creation means, and recognition of an image for recognition, while collating an input image and a 
dictionary drawn up with said creation means It is image recognition equipment characterized by 
having having a collating means which collates by considering reliability about each pixel of an input 



image based on brightness of each pixel of said brightness frequency dictionary and input image in 
the case of this collating. 

[0007] Moreover, this invention is image recognition equipment characterized by making it not use 
said collating means for collating about a pixel which seemingly will not be reasonable as an image 
for recognition based on said brightness frequency dictionary in image recognition equipment 
according to claim 1. 

[0008] Moreover, this invention is image recognition equipment characterized by said collating 
means making a contribution to a collating result low about a pixel which seemingly will not be 
reasonable as an image for recognition based on said brightness frequency dictionary in image 
recognition equipment according to claim 1. 

[0009] Moreover, it is image-recognition equipment characterized by for this invention, to have a 
brightness frequency dictionary for a face field extract which holds said brightness frequency 
dictionary in quest of brightness frequency of a logging image at the time of dictionary data 
origination by said creation means in image-recognition equipment according to claim 1, and for said 
extract means to have a face field extract means which cuts down an image for recognition based 
on said brightness frequency dictionary. 

[0010] This invention is set to image recognition equipment according to claim 4. Moreover, said 
dictionary creation means A parts dictionary about a parts image of a specific region in an image for 
recognition is drawn up. Said said brightness frequency dictionary has a brightness frequency 
dictionary for a kind parts extract held in quest of brightness frequency of a parts image of parts 
dictionary creation time by said creation means. Said extract means is image recognition equipment 
characterized by having a kind parts extract means which cuts down an image for recognition to a 
kind parts image extracted with said face field extract means based on said brightness frequency 
dictionary. 

[0011] Moreover, this invention draws up a face collating dictionary concerning [ said dictionary 
creation means ] an image for recognition of a specific region in an image for recognition in image 
recognition equipment according to claim 5. Said said brightness frequency dictionary has a 
brightness frequency dictionary for face collating held in quest of brightness frequency of an image 
for recognition of a face collating dictionary by said creation means. Said extract means has a 
collating positioning means which chooses an image for recognition of a specific region in an input 
image from relation with a kind parts image extracted with a kind parts extract means as an image 
for collating. In case said collating means collates a face collating dictionary and an image for 
collating, it is image recognition equipment characterized by being based based on said brightness 
frequency dictionary for face collating, and collating using a pixel which will seemingly be reasonable 
as an image for recognition. 

[0012] Moreover, this invention draws up a dictionary for collating from an inputted image for 
recognition in an image recognition method of recognizing an image for recognition. While holding in 
quest of brightness frequency of each pixel in an image for recognition based on two or more of 
these created dictionary data and collating an input image and said dictionary at the time of 
recognition of an image for recognition It is the image recognition method characterized by collating 
by considering reliability about each of an input image based on brightness of each pixel of 
said brightness frequency dictionary and input image on the occasion of this collating. 
[0013] Moreover, in an image recognition method according to claim 7, in case this invention 
collates said input image and said dictionary, it is image recognition equipment characterized by 
making it not use for collating about a pixel which seemingly will not be reasonable as an image for 
recognition based on said brightness frequency dictionary. 

[0014] Moreover, in an image recognition method according to claim 7, in case this invention 
collates said input image and said dictionary, it is image recognition equipment characterized by 
making a contribution to a collating result low about a pixel which seemingly will not be reasonable 
as an image for recognition based on said brightness frequency dictionary. 

[001 5] Moreover, this invention starts an image field for recognition from an inputted image in an 
image recognition method of recognizing an image for recognition. A face collating dictionary for 
collating is drawn up from an image for recognition of a specific region in an image of a started 
image field for recognition. In quest of brightness frequency of each pixel in a *****3|c**(ed) image 



field for recognition, a brightness frequency dictionary for a face field extract is drawn up. At the 
time of recognition of an image for recognition, a field which will seemingly be reasonable as an 
image for recognition is extracted as an image for recognition based on said brightness frequency 
among images in an input image. It is the image recognition method characterized by choosing an 
image for collating from an image for recognition of a specific region in this extracted image for 
recognition, and making it collate with said face collating dictionary. 

[0016] Moreover, in case this invention draws up said face collating dictionary, while it draws up a 
parts dictionary about a parts image of a specific region in an image for recognition in image 
recognition equipment according to claim 10 In drawing up a brightness frequency dictionary for a 
parts extract in quest of brightness frequency of a parts image of parts dictionary creation time and 
performing collating with said face collating drctionary and input image Based on a brightness 
frequency dictionary for a parts extract, and a parts dictionary, a parts image of an input image to a 
specific region is extracted from an image for recognition extracted based on said brightness 
frequency dictionary for a face field extract. It is the image recognition method characterized by 
choosing physical relationship of an extracted parts image to an image for collating from an input 
image, and making it collate with said face collating dictionary. 

[0017] Moreover, in an image recognition method according to claim 10. in case this invention draws 
up said face collating dictionary A brightness frequency dictionary for face collating held in quest of 
brightness frequency of an image for recognition of a face collating dictionary is ******(ed). In case 
said face collating dictionary and an image for collating are collated, it is the image recognition 
method characterized by collating using a pixel which will seemingly be reasonable as an image for 
recognition based on said brightness frequency dictionary for face collating. 

[0018] Moreover, an image input means by which this invention inputs an image for recognition in 
image recognition equipment which recognizes an image for recognition, A focus extract means to 
extract the focus from a sample image inputted with this input means. A dictionary creation means 
to form a partial dictionary pattern showing characteristic quantity of a partial image near [ which 
was-extracted with this focus extract means / each ] the focus, and to draw up a partial dictionary 
of each focus, At the time of recognition of an image for recognition, said focus extract means 
extracts each focus from an input image inputted with said input means. It is image recognition 
equipment characterized by having a collating means to perform collating with characteristic 
quantity obtained from a partial image near [ each ] the focus, and a partial dictionary pattern of 
said partial dictionary. 

[001 9] Moreover, this invention is set to image recognition equipment of claim -1 3,-and said focus 
extract means is image recognition equipment characterized by extracting a point with a unique 
brightness change in a small field in an image for recognition, or a point with a unique geometry 
change of brightness as the focus. 

[0020] Moreover, this invention is image recognition equipment characterized by holding only a 
partial dictionary pattern which evaluated redundancy about two or more partial dictionary patterns 
obtained from a sample image, and was made into non-redundancy at the time of dictionary 
construction by said dictionary creation means in a partial dictionary in image recognition equipment 
of claim 13. . - - . . . : . . . . - . - _ . . . - ^ . 

[0021] Moreover, in image-recognition equipment of claim 13. at the time of dictionary construction 
by said dictionary creation means, this invention do not hold two or more partial dictionary pattern 

**** obtained from, a sample .image,, but hold a partial dictionary. pattern in which characteristic - - 

quantity be show for every predetermined gap, and be image-recognition equipment characterize by 
complement characteristic quantity using an approximation function complement about the section 
in the meantime. 

[0022] Moreover, it is image recognition equipment characterized by collating by this invention's 
creating and holding a contribution coefficient which contributes to recognition according to a use 
to each partial dictionary pattern in image recognition equipment of claim 13, and said collating 
means considering a contribution coefficient. 

[0023] Moreover, this invention is image-recognition equipment characterized by to use a set of a 
partial dictionary pattern effective in recognition on the occasion of collating by said collating 
means, creating and holding a contribution coefficient which contributes to recognition according to 



a use to each partial dictionary pattern, and choosing from distribution of a contribution coefficient 
according to a use of recognition in the image-recognition equipment of claim 13. 
[0024] Moreover, this invention has a brightness frequency dictionary held in quest of brightness 
frequency of each pixel in an image for recognition based on dictionary data in image recognition 
equipment of claim 13. While collating an input image and a dictionary drawn up with said creation 
means at the time of recognition of an image It is image recognition equipment characterized by 
having having a collating means which collates by considering reliability about each pixel of an input 
image based on brightness of each pixel of said brightness frequency dictionary and input image in 
the case of this collating. 

[0025] Moreover, this invention is set to an image recognition method of recognizing an inputted 
image for recognition. Extract the focus from an inputted sample image, form a partial dictionary 
pattern showing characteristic quantity of a partial image near [ this / that was extracted / each ] 
the focus, and a partial dictionary of each focus is drawn up. An image recognition method 
characterized by performing collating with characteristic quantity which extracted said focus from 
an inputted input image at the time of recognition of an image for recognition, and was obtained 
from a partial image near [ each ] the focus, and a partial dictionary pattern of said partial 
dictionary. 

[0026] Moreover, this invention is the image recognition method characterized by setting to said 
focus extract and extracting a point with a unique brightness change in a small field in an image for 
recognition, or a point with a unique geometry change of brightness as the focus in an image 
recognition method of claim 20. 

[0027] Moreover, this invention is the image recognition method characterized by holding only a 
partial dictionary pattern which evaluated redundancy about two or more partial dictionary patterns 
obtained from a sample image by said dictionary creation time, and was made into non-redundancy 
in a partial dictionary in an image recognition method of claim 20. 

[0028] Moreover, in an image recognition method of claim 20, this invention does not hold two or 
more partial dictionary pattern **** obtained from a sample image to said dictionary creation time, 
but holds to it a partial dictionary pattern in which characteristic quantity is shown for every 
predetermined gap, and is the image recognition method characterized by complementing 
characteristic quantity using an approximation function complemented about the section in the 
meantime. 

[0029] Moreover, it is the image recognition method characterized by collating by this invention's 
creating and holding_a contribution coefficient which contributes.to recognition according-to a use- 
to each partial dictionary pattern in an image recognition method of claim 20, and said collating 
means considering a contribution coefficient. 

[0030] Moreover, this invention is the image-recognition method characterized by to use a set of a 
partial dictionary pattern effective in recognition on the occasion of collating by said collating 
means, creating and holding a contribution coefficient which contributes to recognition according to 
a use to each partial dictionary pattern, and choosing from distribution of a contribution coefficient 
according to a use of recognition in an image recognition method of claim 20. 
[0031] Moreover, while this invention draws up a brightness frequency dictionary held in quest of 
brightness frequency of each pixel in an image for recognition based on dictionary data in an image 
recognition method of claim 20 and collating an input image and said dictionary at the time of 

recognition of an.image It is the image. recognition method characterized. by having collating-by. 

considering reliability about each pixel of an input image based on brightness of each pixel of said 
brightness frequency dictionary and input image in the case of this collating. 
[0032] Moreover, a taking-in production process which captures the target image with which this 
invention contains a person's face image, A focus detection production process of detecting the 
focus about an object image captured at the aforementioned taking~in production process based on 
the singular point of distribution of brightness in a predetermined field of this object image, A 
contribution coefficient production process of computing two or more contribution coefficient which 
shows effectiveness for collating about two or more set semi- data of the focus for recognition 
stored beforehand, respectively, and storing it. Said criteria data deletion production process of 
deleting two or more redundant criteria data in consideration of the redundancy of criteria data. The 



remaining two or more set semi- data at the time of redundant criteria data being deleted at said 
criteria data deletion production process, It is the image recognition method characterized by having 
a collating production process which compares the focus detected at said focus detection 
production process for said two or more unit fields of every based on said contribution coefficient 
stored at said contribution coefficient production process, and collates said object image. 
[0033] 

[Embodiment of the Invention] Hereafter, this invention is explained to details using a drawing. A 
person's face field included in the image inputted from the video camera using this method is 
started in below, the posture is presumed to it, and two operation gestalten of the equipment for 
performing collating with the person in an image and the person by whom dictionary registration was 
done are shown in it. 

[0034] (1) The outline processing explanatory drawing 1 of the 1st operation gestalt is a block 
diagram of the operation gestalt of this equipment. The image storage section 2 by which this 
person face recognition equipment is connected with the image input section 1 in drawing 1 at this, 
The user control unit 13 connected to this, and the dictionary creation section 14 connected to this. 
The brightness frequency dictionary 7 for a face field extract connected to this, the brightness 
frequency dictionary 8 for a face parts extract, the brightness frequency dictionary 9 for collating, 
the face field extract section 3, the face logging dictionary 10, the face parts extract section 4, the 
face parts dictionary 11, and the collating fixing section 5, It has the collating section 6, the face 
collating dictionary. 12 connected to this collating section 6, and the display 15 connected to the 
collating section 6. 

[0035] In order to perform actuation of above-mentioned face logging, posture presumption, and 
face collating, the face parts dictionaries 11, such as the ****** dictionary 10 drawn up from the 
face image sample in two or more person object under two or more photography environment, an 
eye, and opening, are used for this person face recognition equipment. 

[0036] Drawing 2 is the flovy chart of the outline processing concerning the 1st operation gestalt 
concerning this invention, and explains actuation hereafter using drawing 2 . The system of this 
operation gestalt stores in the image storage section 2 once the digital image data inputted every 
several frames by the image input section 1 which consists of monochrome, or the television 
camera and A/D converter of a color (S11). And in the face field extract section 3, the brightness 
reliability for a face field extract is calculated about each pixel based on the brightness frequency 
dictionary 7 for the face field extract sections (SI 3), and a face field is extracted from this 

brightness reliability for a face field_extract,_and the_**** delivery .volume_dictionary J 0_(S1.5) 

[0037] Furthermore, in the face parts extract section 4, the brightness reliability for face parts 
inside a face field is calculated (SI 7), and an eye and opening are extracted from this brightness 
reliability and a face parts dictionary (SI 9). Furthermore, in the collating fixing section 5. the image 
for collating is generated from the location of a face parts field, and an input image (S21). And finally 
collating with the person in an input image and the registration person in a dictionary is performed 
by what (S23) pattern matching of the image for collating and a registered dictionary is computed 
for according to the reliability dictionary for collating in the collating section 6. 
[0038] that is, although pattern matching of an input image and a face dictionary perform , it be 
match focusing on that whose reliability of distribution of a brightness be a constant level among 
the dictionary data of a face image , and the person recognition equipment which secure fixed 
recognition precision.quick.moreover realize.using .the brightness reliability-which.be the feature of _ . 
the 1st operation gestalt of this invention in that case in the face field extract section 3 , the face 
parts extract section 4 , and the collating section 6 . 

[0039] In addition, when the input from a monochrome camera is targetted by the image input 
section 1 in the above, the image storage section 2 is accumulated as a gray shade image of 8bits, 
and when aimed at a color picture, the color component of an input image (RGB) is changed and 
accumulated in the brightness component Y. Generally the transformation at this time is not used 
and does not interfere [ following ]. 
[0040] 

Y=0.30, R+0.59. G+0.11, andB (1) Although the above is the outline of the 1st operation gestalt, 
processing of the face field extract section 3, the face parts extract section 4, the collating fixing 



section 5, and the collating section 6 is explained below at details. 

[0041] At the beginning of creation **** efface logging, a collating dictionary, and a face parts 
dictionary (a) Two or more photography environment. Collect the sample image data of the face of 
two or more person object, and the field where a face is contained from these is started by the 
manual by the user control unit 13 which consists of a mouse etc. After setting to li (i= 1, — P, and 
P:all measurement size) the image vector checked by the display 15. normalizing these each in the 
fixed size N and normalizing the lighting fluctuation in each image, principal component analysis is 
carried out. the K-dimensional (k<N) high order of characteristic value is used, and a face pattern is 
created, the characteristic vector (N dimension) according to the above-mentioned principal 
component analysis as a dictionary generated at this time — characteristic value — size — it 
becomes what was collected k pieces from the direction. 

[0042] It is [i so that the size of a normalization image may become fixed as normalization about 
size here. It is good as what was thinned out at a fixed rate. Moreover, as normalization to lighting 
fluctuation, "normalization by the image magnitude of a vector" (each component of a vector is 
broken by the norm of a vector) stated on the "conventional trouble" of this specification, 
"histogram flattening" [1], "the plane reliance panel of the brightness in a screen" [1], etc. may be 
used. 

[0043] Use as the dictionary 12 for face collating what was created as sample image data for 
dictionary creation above only using the data of every two or more photography environment and 
each registrant j (however, j= 1, — Pj;sigmaPj =P), and let what was created using all of two. or more 
photography environment and two or more persons' data be the face logging dictionary 10. 
[0044] the face parts dictionary 11 — ** — although it is made like and created, only the field of an 
eye and opening is used, not using the whole face as a sample image. 

(b) Explain a face field extract, a face parts extract, next detection of the face location in the input 
screen performed in the face field extract section 3 according to the flow chart shown in drawing 2 . 
In the face field extract section 3, detection of a face location is performed about the image 
(subject copy) temporarily accumulated in the image storage section 2, - 
[0045] The luminance distribution for a face field extract is acquired from introduction and the 
brightness frequency dictionary 7 (S31). And. the contraction image which reduced the subject copy 
to a certain resolution is created (S33). Next, brightness reliability for a face field extract for every 
screen of a contraction image is computed (S35). (updating) And the raster scan of the inside of this 
contraction image is carried out. a partial image is cut down according to the feature-vector size 

- accumulated into the above-mentioned face logging dictionary. 1 1, and matching between feature 
vectors is taken (S37). furthermore — and when two or more reduction percentage is changed, let a 
location when evaluation of the above-mentioned matching is the best be a face field (S39, S41). 
[0046] As the matching technique, it does not interfere using the method of detecting so-called 
inner product max, or detecting so-called distance (for example, Euclidean distance, Mahalanobis 
******, etc.) min about the feature vector in a dictionary, and a partial image vector so that 
generally as the pattern recognition technique of an alphabetic character or voice. 
[0047] By the method specifically according to an inner product 1/|V| sigma (V, Dk) (2) k= 1, K 
however V:logging partial image-size normalization Image vector, Dk: It is the i-lh characteristic 
vector of a face logging dictionary, and |Vl is the norm of V. Although (V, Dk) are the inner products 
of V and Dk, if both number of dimension is made into M dimensions (V, Dk) =sigma v (i) x dk (i) (3) It 

-is set to i=l and M. However, v (i) and dk (i) are the-i^h element .of_each_ vector. _ 

[0048] Moreover, by the method by distance, above-mentioned V and the central value D1 of a 
dictionary sample vector, for example, the above, are projected on Dk (k= 1, — K), the projection 
vectors PV and PD1 are created, respectively, and those Euclidean distances 1 are found. 
[0049] Here PV (k) = (V, Dk) (3) It is ' however k= 1. — K, and is PD1 (k) = (D1. Dk). 
[0050] Although it is equivalent to each pixel of the image which normalized the partial image with 
which each element of V was cut down out of the input image in fixed size, and was created in 
formula (3) and formula (3) ' above, weighting is perform using the reliability judged for every pixel 
from the below-mentioned brightness frequency dictionary about these, and the point of this 
operation gestalt matches only using the pixel which will seemingly be reasonable as brightness. 
Therefore, (3) and (3) ' is as follows. 



[0051] 

(V. Dk) =PV(k) =sigma v(i) xT(i) xdk (i) 

(3) " — i= 1 and M — T (i) is the above-mentioned reliability here. Whenever each resolution image 
is generated above, brightness reliability is also updated by the below-mentioned method. 
[0052] the interior of the face field detected in the above-mentioned face field extract section 3 in 
the input image although an eye and opening were extracted as parts in the face parts extract 
section 4 — receiving — this and ** — pattern matching [ like ] is performed using the face parts 
dictionary 11^ Drawing 4 is a flow chart which shows extract processing of face parts. In drawing 4 , 
one kind in a left eye, a right eye, and opening is set up as attention parts (S51). At this time, 
reliability is computed for every parts called an eye and opening using the separate brightness 
frequency for attention parts (S53). And attention parts are extracted from brightness reliability and 
an attention parts dictionary inside a face field (S55). Thus, attention parts are changed one after 
another and detection of attention parts is continued until it extracts all parts (S57) (S59). 
[0053] (c) Describe creation of a brightness frequency dictionary, a reliability judging, next creation 
and the judgment of reliability of a brightness frequency dictionary (7, 8, 9) below. 
[0054] As shown in the above-mentioned (a), the sample images about two or more registrants in 
two or more photography environment are collected, and the image which normalized in the fixed 
size N about the image li (i= 1, — P, Pimeasurement size) cut down as a face field is set to Ji (i= 1, - 
-P, Pimeasurement size). 

[0055] First, each Ji The-frequency distribution h of the brightness to all samples (x y, I) is created 
to each pixel (x y) of every, and it considers as a brightness frequency dictionary (7, 8, 9). And at 
the time of recognition, reliability w (x y) is computed from the correspondence pixel I (x y) of h (x y, 
I) and an input partial image. As a brightness frequency dictionary, a thing separate as 7 for a face 
extract, 8 for a face parts extract, and 9 for face collating is prepared as follows here. 
[0056] Although various things as an algorithm of the above reliability can be considered, reliability 
T (x y) may be determined as follows, for example from average Mh and standard deviation mho h of 

luminance distribution h (x y, I). - - , . 

[0057] 

T(x y) = [1/sigmah (x y. I)} X h (x y, I) 

if |l-Mh |<alpha x mho I=Imin, — Imax = 0 else (4) It corrects, alpha is the constant of the arbitration 
for determining the range of reliability above, and they are Imax and Imin. It is the maximum and the 
minimum value of a brightness value of an observation image. 

[0058] Moreover, the technique by a-posteriori. probability conversion as-shown in reference-[2]-as - 
the another technique may be used. Here, the thing and the event theta 1 which an event theta 0 is 
reliable shall not be reliable. When the a-priori probability of thetaO and thetal before observing the 
brightness of the pixel in a certain partial image is wO and w1, respectively (however, wO+w1=1), a- 
posteriori probability wV — conditional [ of the pixel brightness I under event thetak ] — the time 
of setting probability to P (l|thetak) — from Bayes' theorem w1 -{w1 p (l|theta 1)}/{p(I IthetaO) (1- 
W1 )+p(l|theta 1) w1} (5) It becomes. This w1' is used as reliability. 

[0059] Here, it is from the frequency distribution h of the logging image efface dictionary creation 
time (x y, I). p(l|theta 0) = {l/sigmah (x y, I)} x h (x y, I) (6) In order that it may be I=Imin and ^Imax 
and p dltheta 1) generally may not have a data sample p(l|theta 1) = 1/(Imax-Imin) (7) It presumes. 
Moreover, the above updates with w1=wV, whenever an input image is updated, and it is good as 

initial value =1.0/2.0-of-w1 - _ 

[0060] (d) In the decision collating fixing section 5 of a collating location From the relation between 
the face field called for in the face field extract section 3, and the face parts called for in the face 
parts extract section 4 The quadrilateral field which the eye of a face and the edge of opening 
connect as a field with little aging as people's face is chosen, and the image for collating which 
deformed using the geometric conversion which this indicates is in agreement with the location of 
the face parts in the collating dictionary 1 2 to affine transformation or [3] and which starts and 
consists of a field is created. 

[0061] (e) Use the flow chart of drawing 5 for the collating last, and explain the procedure of 
collating to it Although pattern matching is performed by the technique same about the image for 
collating created by the collating fixing section 5 as the collating dictionary 12 and (b) having shown. 



in this case, it is every registrant's j dictionary Dj in the collating dictionary 12 as a candidate for 
collating. It becomes. 

[0062] That is, Individuals i thing Di is set up as an attention registered person's dictionary (S61). 
And an attention person's brightness reliability is calculated (S63). Furthermore, similarity (distance) 
with an attention person is computed from illuminance reliability and Di inside a collating field (S65). 
Also in here, matching based on (3) " types using the reliability judged about each pixel within a 
logging image is performed. Here, they are an image for collating, and Dictionary Dj. In matching, 
since it differs for every individual dictionary, brightness frequency distribution may use different 
distribution for every dictionary for collating, may create one frequency distribution of the whole 
registrant, and may use only this. 

[0063] Furthermore, collating is completed with outputting ID of the person to whom this was 
carried out until it collated this matching with all registrants (S67, S71), and optimal matching was 
carried out (S69). 

[0064] Furthermore, although it is how to treat the reliability in the case of this matching, that at 
least two method is considered. According to the degree of reliability, one makes this value a 
coefficient and it matches according to this magnitude. That is, about the unit field (mesh) where 
reliability is high, matching near 100% is performed with a subject-copy image in dictionary data. On 
the other hand, the degree of matching with a low degree is reduced about the unit field (mesh) 
where reliability is low. By carrying out like this, the optimal matching according to the degree of 

brightness is realizable. 

[0065] Another prepares the threshold about reliability, and if this reliability is a value below 
constant value, it will not perform matching with a subject-copy image about the unit field (mesh) of 
the dictionary data about this reliability. With constant value [ more than ], the unit field (mesh) of 
this dictionary data performs matching with a subject-copy image. 

[0066] (2) Use a drawing for below and explain the 2nd operation gestalt of outline processing 
explanation of the 2nd operation gestalt, next this invention to it at details. 

[0067] The feature of the 1st operation gestalt searches for the reliability about the brightness of 
dictionary data, it is treating dictionary data according to this reliability, and without dropping an 
essential discernment precision, is reducing the frequency of matching and raises the speed of 
recognition processing. 

[0068] The so-called contribution coefficient which, on the other hand, expresses the degree which 
contributes to discernment processing among each unit field (mesh) of dictionary data as the 2nd 

point of-this-invention in the-2nd-operation-gestalt is calculated, and this is-made to correspond to - 

each unit field of each dictionary data, and is made to memorize beforehand. And about the unit 
field where a contribution coefficient is low, although it corresponds to this other than the face field 
on a screen (for example, the image of a background), in the case of matching with a subject-copy 
image, as this does not consider as the object of matching, it shortens the time amount concerning 
matching, and attains speeding up of recognition processing, for example. 

[0069] Drawing 6 is the block diagram of the 2nd operation gestalt. The image storage section 2 by 
which this person face recognition equipment is connected with the image input section 1 in drawing 
6 at this. The focus extract section 18 connected to this, and the dictionary creation section 19 
connected to this. The dictionary reevaluation section 20 connected to this, and the partial feature 
dictionary 23 connected to this, the face field extract section 21 connected to this, and the face 
posture presumption section 22 connected to this. The collating section 6 connected to this, and 
the contribution coefficient 24 for a face field extract connected to the dictionary creation section 
1 9, the contribution coefficient 25 for face posture presumption and the contribution coefficient 26 
for collating, Furthermore, it has the brightness frequency dictionary 27 for a face field extract 
connected to the dictionary creation section 19, the brightness frequency dictionary 28 for face 
posture presumption, the brightness frequency dictionary 29 for collating, the display 31 connected 
to the collating section 60, and the user control unit 30 connected to this, respectively. 
[0070] In such structure, the contribution coefficient the person face recognition equipment of the 
2nd operation gestalt indicates the recognition effectiveness of each projection data in a dictionary 
to be for every use of a face field extract, face posture presumption, and collating is created by the 
dictionary creation section 19. Furthermore, speeding up of collating processing is attained, the unit 



field (mesh) of dictionary data not being used for matching for discernment, or not being used, and 
holding a substantial discernment precision according to the degree of this contribution coefficient 
[0071] Hereafter, actuation of processing of the 2nd operation gestalt and the point of other this 
inventions are explained using the flow chart of drawing 7 and drawing 8 . The system of this 
operation gestalt stores in the image storage section 17 once the digital image data inputted every 
several frames by the image input section 16 which consists of monochrome, or the television 
camera and A/D converter of a color (S71), cuts down the image of the perimeter of the focus 
obtained with the focus extract means 18, and draws up the above-mentioned dictionaries 23, 27, 
28, and 29 with the dictionary creation means 1 9. 

[0072] First, the digital image inputted by the image input section 16 every several frames is 
inputted, and it stores in the image storage section 17. Furthermore, as the 3rd point of this 
invention, in order to perform actuation of face logging, posture presumption, and face collating, the 
focus extract section 18 extracts the focus effective in recognition from the face image sample in 
two or more person object under two or more photography environment efficiently (S73). And in the 
dictionary creation section 19, it has a small field near the focus as an aperture, and the subspace 
which carried out principal component analysis of these apertures image, and the projection vector 
which projected each aperture to the space are created as a dictionary 24 (S75). according to this 
method — an input image — also receiving — ** — a part efface field according to partial lighting 
fluctuation and other bodies of the face image at the time of recognition by calculating the 
projection to said subspace of the image [ like ] in window region, and using matching with them and 
the projection in a dictionary — it also receives hiding and generating of incorrect recognition can 
be controlled. 

[0073] Furthermore, as the 4th point of this invention, evaluation and compression of the 
redundancy of the projection data in a dictionary are performed in the dictionary reevaluation 
section 20. That is, when projection data draws the same pattern as a repeat and exists in 
dictionary data, in case matching with a subject-copy image is performed, by deleting data which 
does not not much have comparative-semantics, and data with high redundancy, dictionary data is 
reduced and, thereby, matching speed is raised (S77). 

[0074] Next, the approximation function expression of the image projection fluctuation inside each 
sample raised as the 5th point of this invention is performed, and this performs efficient reduction of 
dictionary data (S79). That is, when dictionary data can express with an approximation function, the 
amount of data can be sharply reduced by expressing this with an approximation function. Thereby, 

matching speed can .be-raised. very much. _ . _ 

[0075] Such an image sample is classified according to a manual for every use (S81). And in the 
dictionary creation section 19, the contribution coefficient which is the feature as the 2nd point of 
this invention raised first is computed. That is, the contribution coefficient showing the degree 
which participates in discernment processing among each unit field (mesh) of dictionary data is 
calculated, and this is made to correspond to each unit field of each dictionary data, and is made to 
memorize beforehand (S83). And about the unit field where a contribution coefficient is low. 
although it corresponds to this other than the face field on a screen (for example, the image of a 
background), in the case of-matching with a subject-copy image,, as this does not consider as the - 
object of matching, it shortens the time amount concerning matching, and attains speeding up of 
recognition processing, for example. 

[0076] And .improvement Jn.processing speed is aimed at by choosing from lhe.statistical 

distribution over all dictionary samples only what has a high contribution coefficient as the 6th. point 

of this invention more than a threshold, and using it for matching (S85). 

[0077] Next, from an input image, in the face field presumption section 21, the face posture 

presumption section 22, and the collating section 23, a face exists where in a screen and it asks for 

whose face it is with what kind of sense from the result of matching with the partial image of the 

perimeter of the focus extracted in the focus extract section 18, and the projection data in a 

dictionary. Matching which carried out with [ which followed the contribution coefficient in the 

above-mentioned dictionary in these the processings of each ] weight is used. 

[0078] moreover, the above matching — setting — further — as the 7th point of this invention — 

the 1st operation gestalt and ** — technique [ like ] is applied, brightness frequency is computed 



by the dictionary creation section 19. and it memorizes in the brightness frequency dictionary (27, 
28, 29). And by computing and using the reliability based on brightness frequency distribution, 
matching by which dictionary data was refined very much is realized, and only recognition speed will 
improve, with recognition precision held. 

[0079] in addition, the 1 st operation gestalt and ** — like, when the input from a monochrome 
camera is targetted by the image input section 1, the image storage section 2 is accumulated as a 
gray shade image of 8bits, and when aimed at a color picture, the color component of an input image 
(RGB) is changed and accumulated in the brightness component Y according to (1) type. 
[0080] Next, drawing 8 is a flow chart explaining the actuation at the time of recognizing a face 
image, memorizes first the face image which is a very [ recognition ] image in the image storage 
section 17 by the input section 16 (S91), then, extracts the focus in an input face image in the 
focus extract section 18. and cuts down the partial image of the perimeter (S93). Furthermore, 
matching between the partial images and dictionary data which were cut down in the face field 
extract section is performed, and a face field is presumed (S95). In this case, the contribution 
coefficient 24 for a face field extract and the brightness frequency based on the brightness 
frequency dictionary 27 for a face field extract are used together, and a face field is presumed. 
Furthermore, matching with the partial image and dictionary 23 based on the coefficient 25 for face 
posture presumption and the brightness reliability computed based on the brightness frequency 
dictionary 28 for face posture presumption is performed, and a face posture is presumed (S97). 
[0081] Furthermore in the collating section 60, matching with a partial image and a dictionary 23 is 
performed based on the contribution coefficient 26 for collating, and the brightness reliability 
computed based on the brightness frequency dictionary for collating, and it is judged who the face 
image was (S99). 

[0082] Next, the details of "an extract of- the focus", "construction of a dictionary and 
compression", and the "matching method" are explained among the processings mentioned above. 
In the above, the outline of processing actuation of the 2nd operation gestalt was explained. 
[0083] (a) By extracting the feature portion of an image which are the focus extract from-an input - 
image and also the 3rd point of this invention explains below the production process which performs 
image recognition at details. The focus extract section 18 extracts the" feature effective in 
recognition from the inside of an image by the following methods as the 3rd point of this invention. 
[0084] In case matching of an input image and a dictionary image, continuation inter-frame 
matching, etc. are performed, it is common to choose a certain field and to make the perimeter into 
a template. Although it is for one improvement jn the speed of this reason, another is .for choosing a 
point, i.e., other points and the point that distinction is attached, more effective in matching (based 
mainly on distance or correlation) compared with the case where all the pixels equivalent to an 
object are made into a trace processing object. The edge in an image is well known as such a thing. 
[0085] such [ here ] a texture — in order to search for the unique effective feature, the effective 
focus extract technique of [4] is used, and it accelerates with pretreating to this. About the image 
for dictionary creation, or the whole input image surface for recognition, when processing [4], it is 
necessary to calculate the characteristic value of 2x2 matrix about each pixel, and processing cost 
starts. Then^ beforehand, all the textures in a processing field extract a corner point by the below- 
mentioned method as a characteristic point geometrically, and limit an effective focus extract field 
only near the corner point. 

[0086] The production process, of the following processings is explained based Qn, drawing 9. ..The 

corner appearance in here evaluates the edge it is supposed that it is good characteristic quantity 
experiential ly in the mask size of 5x5 pixels, and judges a corner point. 
[0087] First, if an image is inputted by the image input section 16 (SI 01), sobel [5] will be 
beforehand applied to a subject copy, and discriminant analysis binary-ization [6] will be performed 
(SI 03). And about each pixel in this image, when an attention pixel is 1, convolution with the mask 
of drawing 10 (a) is taken. And with the value of the pixel value p of a result, it judges whether an 
observing point is close to "+" mold branch point, or it is close to "x" mold branch point, here — as 
follows — a case — having divided . 
[0088] 

the p>=2x maximum pixel value -> "+" mold branch point The p<=-2x maximum pixel value -> "x" 



mold branch point Except the above -> the degree which is not a corner candidate — " — the case 
where it is based on the mask of drawing 1 0 (b) about +" mold and each "x" mold — dividing — 
coming out — the above and ** — it is made like and a corner point is searched for (S namely, the 
time of a mold — as follows — a part for a case — an opium poppy — if it is not a ~> corner 
point — a case — dividing — carrying out — . The - maximum pixel value The <=p< maximum pixel 
value -> Corner point Except the above -> Case of "x" mold branch point which is not a corner 
point The - maximum pixel value The <=p< maximum pixel value -> Comer point Except 
[0089] According to a degree type, the focus is Judged about the inside of the perimeter field W of 
the observing point p (x y) finally judged to be a corner (SI 07). It is Gradient of the following first, 
matrix It asks for G. 
[0090] 

G=sigma(gx* gx gx* gy) = (a b) (8) W (gy* gx gy* gy) (c d) 

Next, it asks for characteristic value lambda+ of G, and lambda-, and let the larger one of them be 
the pixel value of an observing point. A certain threshold lambdat after following all the pixels of a 
processing field in the above-mentioned processing Lambda>lambdat (9) A part for the becoming 
pixel value part is extracted as the effective focus. That for which it asked by discriminant analysis 
about each pixel value inside a processing field is used for the threshold at this time. 
[0091] (b) In construction of a dictionary and the compression dictionary creation section 19 About 
each face image sample, the partial image inside the field of the perimeter WxH pixel of the 
recognition effective focus extracted by the method of of the foregoing paragraph (2) and (a) is cut 
down. Principal component analysis of them is carried out, and the projection of a fractionation 
image vector is all searched for about the matrix (dictionary space matrix) which asked even for the 
Kth characteristic vector of them, and all the face samples to the space, and it considers as a 
dictionary, 

[0092] However, since the above-mentioned dictionary becomes huge when there are many 
measurement sizes, dictionary capacity is reduced according to the 4th point of above-mentioned 
this invention, and the 5th point. First, with the method as the 4th point of this invention, it is the - 
projection Pi to the dictionary space of each partial image Wi (i= 1, — NI) inside certain sample 
image 1. It asks and they are its projection among the partial images in the other sample image J (J! 
=1), and Pi. Distance is min so-called [ wj ]. The distance Pj It asks. By this, it is wj. Since the 
sample image which belongs is called for, it is the partial image Wi in Image I. It matches and also 
the frequency distribution of a sample image is acquired, this time — frequency distribution — max 

— other sample image Jmax It is defined as having matched Image 1. Moreover, I and Umax at this 
time Distance Dij is defined below. 

[0093] 

Dij=1/j sigma |Pi-Pj | (10) j= 1, — K, however Pj : A thing, K from which a partial image and projection 
distance in Image I serve as min among the partial images in Image J: It is the number. Moreover, 
Euclidean distance and absolute value distance may be used as a distance. 

[0094] The example of the table of the result of having calculated the above about all samples is 
shown in drawing 1 1 . The frequency distribution of such distance (D12-Dnn-1 in drawing) is 
created, and with the application of a discriminant analysis method [6], a threshold is calculated so - 
that the variance ratio of frequency distribution may be made into max. At this time, that projection 
is excepted from a dictionary about the partial image in the sample image whose distance was 
nearer than the threshold. _ 

[0095] Moreover, by the method as the 5th point of this invention, it shall apply, after performing 
dictionary reduction per sample image by the method of the 3rd term. First, the face image sample 
for dictionary creation is sorted about a certain individual in order of a certain parameter, for 
example, lighting, and fluctuation of a direction. If it does in this way, it will become general and the 
projection of the partial image in between [ which suits ] sample images will continue. 
[0096] a part for next, all of above-mentioned sample images — an image — jMc****** — although 
the projection distance between each partial image is found — if — a certain partial image wi from 

— partial image wj of others when it sees the case where the minimum distance is suited — 
coincidence — wj from — a minimum distance partial image — wi it is — sometimes, it is 
considered that both are resemblance. And when the partial images wil, wi2, and wi3 in adjoining 



sample image sequences and — win are similar, an approximation function expresses these 
collectively and dictionary size is reduced. Although a ^Me** thing [ like ] can be considered as an 
approximation function, the cubic spline used for [7] etc. may be used. 

[0097] Furthermore, as the 2nd point of this invention, according to the use of face extract posture 
presumption and collating, the contribution coefficient for effective recognition is created from a 
dictionary sample, and adjustment of the method of effect in the recognition result of a partial 
dictionary pattern is enabled for every use about each partial dictionary pattern in the dictionary 
creation section 19 at the time of recognition. The example of the calculation method of a 
contribution coefficient is raised to below. 

[0098] It asks for the above-mentioned focus image about two or more samples first arranged for 
every use. In order that the luminance distribution of this image may show singularity within a 
sample image, for the optimal recognition, there should be recognition contribution for that use in 
that partial-near the focus image highly. Then, principal component analysis of said focus image is 
carried out for every use sample, and the characteristic value Pi of K high orders and a 
characteristic vector Vi (i= 1 — n) are extracted. sigmaPi x Vi (1 1) The weight image for every use 
is created by i= 1 and k. 

[0099] Each partial image wi at the time of putting each sample image I on the above-mentioned 
weight image finally, and making it meet The contribution coefficient C wi for every partial image is 
computed as follows by the total S of the weight pixel value in a field. 
[0100] 

Cwi=S/Pmax (12) 

It is Pmax here. It is the maximum pixel value in the above-mentioned weight image. In addition, it is 
made to be the following as a sample for every use, for example. 

[0101] [n the case- of face field presumption : Thing face posture presumption which set the value 
outside a face field to 0 by the feature image of each sample image : Face collating same as the 
above : what created the feature image about the sample image set for every individual — an 
individual contribution coefficient by creation and the above for every individual dictioriary.in this . _ 
case Although effect of the recognition processing on a partial image dictionary with high 
redundancy is lessened if possible and the purpose improves precision with the contribution 
coefficient for every use according to a use In order to prevent the processing-time increase by 
matching in a partial image dictionary with a still lower contribution, as already stated as the 6th 
point of this invention, a threshold is statistically determined from distribution of these coefficients;*- 
and if the partial image dictionary of the contribution coefficient below the threshold is not used, it 
is good, for this reason — contribution coefficient C wi which took the frequency distribution of the 
contribution coefficient called for for every above-mentioned partial image dictionary, calculated the 
threshold CT by the discriminant analysis method [6], and was given for every partial image 
dictionary CwKCT (13) — a partial image dictionary is not used for matching. 
[0102] (c) Match in the following procedures about the inside of an input image in the matching 
method face field presumption section 21, the face posture presumption section 22, and the 
collating section 23. 

[0103] the partial image dictionary which extracted the partial image from the inside of a Production - 
process Irinput image, and was registered into the dictionary about those each — it asks for so- 
called similarity min inside (matching). 

Production process 2: Ask for the thing I which has the highest frequency of occurrence, and. the 
frequency H at that time among the images for samples with which the above-mentioned matching 
partial dictionary pattern is contained. 

[0104] Production process 3: Ask for reliability T= (the total number of partial windows in maximum 
frequency-of-occurrence H / input image). 

the partial window equivalent to the Production process 4:above I — the shift of a dictionary to a 
coordinate — respectively — asking — the frequency of the vote operation to a shift amount — 
max — a window is chosen and it asks for the affine transformation from them and a dictionary. 
[0105] In the above, the collating section 23 performs processing of a production process 1, a 
production process 2, and a production process 3, and the face posture presumption section 22 
performs a production process 1 - a production process 4. Finally, following a production process 1 



- a production process 4. to the face image field in the sample image I, above-mentioned affine 
transformation is performed and the face field within an input image is presumed in the face field 
presumption section 21. 

[0106] In addition, I in the above is made into estimate and it rejects with assessed reliability T and . 
an affine transformation coefficient, the contribution coefficient C for every use stated in all knots 
in the production process 1 above — using — (3) " and ** — like — partial image li Projection Pi 
to dictionary space If it asks for (k), matching in consideration of a contribution coefficient can be 
performed. 
[0107] 

Pi (k) =sigma I 0) x C (i) x dk Q) (14) For j= 1, M. however M, the number of dimension of a partial 
image vector and C (i) are the partial image li. A contribution coefficient and dk (j) are the No, J 
elements of the k-th characteristic vector of dictionary space. 

[0108] It asks like, further — the reliability of the brightness as the 7th point of this invention — 
(1), (c), and ** — the time of recognition — an input image to the above, and if a partial image is 
cut down like, it asks like T (x y) of (4) types, or wV of (5) types from each coordinate in the field (x 
y) and it is set to T (i) Pi (k) =sigma lO) xC(i) xdkO) xT (i) (15) The robust recognition which took the 
reliability of brightness into consideration by carrying out like j= 1 and M in addition to the 
recognition contribution for every use is attained. 

[0109] As mentioned above, according to the person recognition equipment of this invention, the 
following effects can be considered, that is, although the image recognition method be propose 
as what do not use not much special equipment in that case although a user have the needs of the 
individual collating section with few burdens by non-contact , with equipment , there be a defect 
that exact recognition be difficult , according to an image recognition conventionally in the 
environment which do not have little lighting fluctuation in the field which went as the password of 
OA equipment , such as individual collating with social systems , such as close recession 
management . an ATM device , and a ticket machine , and a personal computer . On the other hand, 
according to this invention, the reliability of the luminance distribution.in a dictionary creation 
sample image is verified about each pixel brightness in an input image, and by using only a pixel 
reliable about brightness, or the pixel of the circumference of it for recognition, even if there is a big 
change of brightness, it makes it possible to perform stable recognition. 

[01 10] moreover, it does not consider as the image of the whole field which has a face as a face 
dictionary, but creates using the subregion near [ which is included in a face image ] the partial 
feature field (partial dictionary pattern), and those matching results are depended on some 
brightness fluctuation and other bodies by carrying out a comprehensive judgment — hiding — etc, 

— the fluctuation to depend is absorbable. 

[01 1 1] Under the present circumstances, it poses a problem that the data size of said partial 
dictionary pattern becomes large, and the equipment scale for realizing this technique becomes 
large. However, the redundancy for every dictionary creation sample image is evaluated statistically, 
a redundant pattern is removed or dictionary size can be made small by approximating and holding 
each partial dictionary pattern with a complement [ hot itself but ] function between samples which 
characteristic quantity change follows etc. 

[0112] Furthermore according to each processing called a face extract, posture presumption, and 
collating, it becomes possible by using it in consideration of contribution of the optimal partial 
pattern to improve the face recognition engine performance remarkably from both sides of the 
precision of recognition, and the processing time. 
[0113] In addition, bibliography is shown below. 

[1] KSung and T.Poggio. and Example-based Learning for ViewH3ased Human FaceDetection.In 
Proceedings from Image Understanding Workshop, pp.843-850, and November 1 994. [2] Nakai: The 
migration body detection technique and ******** using a-posteriori probability, 94-valve flow 
coefficient-90, pp.1 -8, and 1994[3] Yamaguchi, Fukui, Maeda, The face recognition system, Shingaku 
Giho using a dynamic image. PRMU 97-50, pp 17-23, [(1997) 4] Shi, Tomasi, Good Features to 
Track, and IEEE Conference on Computer Visionand Pattern Recognition(CVPR94) Seattle and June 
1994 [5] Tree, Shimoda, a "image-analysis handbook", University of Tokyo Press, and pp. — 553- 
554 and 1991 [6 — 1 Otsu — Distinction and the automatic threshold selecting method based on the 



minimum square criteria. ******. Vol.J63-D. No.4. pp.349-356, and 1980[6] Tree, Shimoda: Image- 
analysis handbook PP.578[7] Murase, NEFk^A, image spotting of the three-dimension body by 
"multiplex resolution and proper space expression, *5Mc3|c5Mc^ Vol.36 No.10. pp 2234-2243. Oct, 1995 
[0114] 

[Effect of the Invention] By as mentioned above, the thing for which according to this invention the 
reliability about brightness is searched for from the dictionary data for performing recognition 
processing of a face image, and this is associated and stored as explained to details Even if it uses 
or uses for matching processing with the subject-copy image from a camera to the unit field (mesh) 
of the dictionary data which seldom fits recognition when the lighting at the time of for example, 
dictionary data being created on the occasion of recognition processing is unsuitable, matching 
processing with a low degree is performed. Matching is ensured about the mesh which realizes 
positive recognition by this, and the mesh with low reliability can realize discernment processing of a 
quick person's face image as a whole by carrying out an abbreviation etc., holding discernment 
precision. 

[01 15] moreover — further — the unit field (mesh) of dictionary data — it is alike, respectively, and 
attaches, and a contribution coefficient is prepared from a viewpoint whether to contribute to 
matching processing however, respectively, and effective [ according to this contribution 
coefficient ] in the case of matching — ■ two or more meshes are alternatively matched as criteria. 
Thereby, discernment processing of a quick person's face image is realizable by reducing dictionary 
data, holding discernment precision. 



[Translation done.] 
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DESCRIPTION OF DRAWINGS 

[Brief Description of the Drawings] 

[Drawing 1] The whole block diagram concerning the 1st operation gestalt of this method. 
[Drawing 2] The flow chart of the outline processing concerning the 1st operation gestalt of this 
invention. 

[Drawing 3] The flow chart of the face field extract concerning the 1st operation gestalt of this 
invention. 

[Drawing 4] The flow chart of an extract of the face parts concerning the 1 st operation gestalt of 
this invention. 

[Drawing 5] The flow chart of the face collating processing concerning the 1st operation gestalt of 
this invention. 

[Drawing 6] The whole block diagram concerning the 2nd operation gestalt of this method. 
[Drawing 7] The flow chart of dictionary creation with the 2nd operation gestalt of this invention. 
[Drawing 8] The flow chart of recognition with the 2nd operation gestalt of this invention. 
[Drawing 9] The focus extract technique concerning the 2nd operation gestalt of this invention. 
[Drawing 10] The corner extract mask concerning the 2nd operation gestalt of this invention. 
[Drawing 11] The example of distance between the sample images concerning the 2nd operation 
gestalt of this invention. 
[Description of Notations] 

1 — Image input section 

2 — Image storage section 

3 — Face field extract section 

4 — Face parts extract section 

5 — Collating fixing section 

6 — Collating section 

7 — Brightness frequency dictionary for a face field extract 

8 — Brightness frequency dictionary for a face parts extract 

9 — Brightness frequency dictionary for collating 

10 — Face logging dictionary 

11 — Face parts dictionary 

12 — Face collating dictionary 

13 — User control unit 

14 — Dictionary 
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j = l,— Pj ; SPj =P) ©7^-^©^S:Mfflbr 

f^^ bfc t> o 1 2 i: b. SSc}i^3gi^. 41 

ifi^miXcDy'-i^^^xmm Lxf^mLrzi><Dt:mm^ 

1/1 V I S (V. Dk) 

k=l. K 

ifeeu V : WtJlUbgP^Bi^-y-'f XiEmii^^^ h 
M Dk:^tOt)mbSE»®^iH#'<^b;u-efet)^ ★ 

(V, Dk) =Z V (i) X 
i = l, Mh'feS, fc^U V (i) , dk (i) tt^ 

[0 0 4 8] *fei^lit:<tS:&ST•^i^ ±3£©Vi:i»» 



•^t)aib^»l Oi:1-So 
[0 0 4 4] ^y^-^yifsi i*)iiij;dtbt:f^j^^ix 

mmimm^3x^dxtimm[^<Dm^w<D^m 

So 

[0 0 4 5] )e«M«i^»7A>e.MM^aaiffl 
ojW!S[^?&?£mffr5 (S3 1) o ^bt. jiii^fefea 

«?^tSS/J>bfctS/Milfe*f^figt- S (S 3 3) c 
m (Hff) -r?) (S3 5) o ^bT> 

7;^^:;^^r-\'>b-c. wMmw^mLmmi i^jcm 

eiUWbs nm^i^ V}Vt(Dm(D-7^^^yy:^t?> (S 
3 7) o Mt> ^bTSI'J^^<fe1ti[g^fc3®-&{'*V^ 

(S3 9, S4 1) , 

[0 0 4 6] ^>y5^>^#^i:bT{is :i:*-^^^cD7^ 

^-ymM^mtLx-m&o^^oiz, mm\^wm^i7 
y-ji^t^^m^-^^ hji/tiz-ov^xcT}^ f^mMxtiii>i) 

&®fflbtMb^A^S:^/^. 
[0 0 4 7] a<*:WK{4rt«{c<fcS;&^t?B:. 



(2) 

★ I V I itYCD^'Jl^AXSb^. (V, Dk) {±Vi:Dk® 

dk (i) (3) 

(k=l. -K) tit^b-C^n^tlMK^i^ b;i,p 
V. P D 1 ^#fiKU ^tl^<D^-^ U «y Hgggl 1 4* 

[0 04 9] ClCltC 
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PV (k) = (V, Dk) 
tc1z\^. k=l, -Kffetjs PDl (k) = (Dl, 

Dk) T'$)Sc 

[0 0 5 0] a±T-iC (3) . (3) ' {ciJV^rVcD 
(V, Dk) =PV (k) =Z 

i = l, M 

[0 0 5 2] M>'^-^:'atBa5 4■c-^±>'1-')^ilU•ri^ □ 

aSI&l&^l-S (S5 1) , ^©ilt, ix □tV^^>'1 

MMS^sm-rs (s5 3) o ^b-c. ^M^rtscT'itf 

S (S5 5) o C:cDJ;d{3bT^-'t-'y*}ftai-rS*-r- 
(S5 7) s ^B-'^-»y;fe»;'^i:#^Tot, ^Bf^- 

"■va^m^i^m^-y^ (S5 9) , 
[0 0 5 3] ( c ) mm.m^nm(oimhmmmM^ 
ifetx (7, 8. 9) (Di'^^h^ommm^ 

T (x, y) = {l/2h (x. y 



14 



(3) 



* bfcffl^SiK*fflv^•rM^!^^^^^^t?fe^f v\ isKfc br** 

bV^li^©(j^^*^ffl bT X' ^><{^^ff 5 ®4s*|IM$ 
^CD;t?^>hT'^So bfcASot. (3) . (3) ' {± 

[0 0 5 1] 

V (i) XT (i) xdk (i) 

(3) " 

[0 0 5 4] miMcD (a) cdJ;^{c> 
ttS^|&e^#t-3V>-rcD1t>7';i/il^?fejK*bx MM 
^i:br«lbm$>i,fcliii^tli (i= l, -P, P : it 

vr^i/ijc) Kov^T-^-y-'fXNtiEmbfciffliii*j 

i (i= 1, -P, P : ■y->X;HS) i:-rs, 
[0 0 5 5] ^r. ^Ji CD^ia^ii^ (x, y) ^iiK 
±-l?->X;i/tM1-SffS0MS^*h (x, y, I) 

wwLh. mmmsL^m (v, s, 9) iit-s. ^brg 
im#ttth (x, y, I) }:.xtiw>^miMi(oniimmi 

» (x, y) tt'^mmmvf (x, y) €:»m-rso c:^: 

8, M!^-&ffl9fcbt:SiJ'*rcD*)0*JilTO<t5tJp{l-r 
5o 

[0 0 5 6] ±l3(Z)J;a*t{illKcD@^i: btttfil^JS: 
*CD*s%^c,tL54S. TztXiSimm'^^h (x, y, 
I) ®¥±9Mhfc:^Jti{g^ahi:A»t,M!i«T (x, 
y) *WTOJ;dt?*^bt^bm>. 
[0 0 5 7] 
. I) } X h (x, y, I) 
if I I-Mh \<a X a 



I = I min, — I max 
= 0 

tzii b±!B-C-a{4M««OlfiH&^f %fc«)CDffiiScD 
^IST-fet), I max , Imin ttmMllfeOffjSfitOS:*: 

[0 0 5 8] SfegiJ^^i: bT{i:A:iK [ 2 ] tstf J;^ 

- ■ wl' = {wl p(I I 01)} / {p(I I 00) (l-Wl)+p(I I 01)wl} - 
ti3it. Z<Dvrl' ^mm&ttVXmi^?>o ^oV^ar^^h (x, y, I) ipf, 

[ 0 0 5 9 ] 3 m^mmm<Dm t) m vm^<Dm^ 

p (I I 00) = {l/2h|x,y,I)} X hlx.y.I) 
I = Imin, -IinaxT-fe?). p (I I 0 1) tt-^tCT^ ♦ 

P (I 10 1) =1/ (Imax - Imin ) 



else (4) 

0, 0 1 0»Slj8i^As^ti^nw 0 , wlT-S5i:§ 
(ife;^ bw 0 +w 1 = 1) . *msi^w 1 ' tt»S0 k 
«D*i:-r-©iB^)Wfi[i®^mt§fii^;£p (i I 0k) 
ii -r s i: # '^'f x©^a4» e> 



IS) 



(6) 



(7) 



l=wl' iiHffU wl©«)SBfil=l. 0/2. Ot 

bt<tVA, 

[0 0 6 0] (d) l5S^ffiS©5sfe^ 



SI 



Ao^J: bTig^^^b©^i>*v^M^i: bT»®a 

[3] fc^1-^^5I^g|Srffll^•t^BbfetODHlbfS^i^ 
[00 6 1] (e) 
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^tol^T, m-^m^l2h (b) -es^LfcOhlll-O 
<iAi®*)©D i^&a^-rs (S 6 1) o ■?UT^ 

xm(ommmm.^^%\n-r^ (sea) o mk. 

(SEH) ^ittH-rs (S6 5) . 3Cfc*jV>Tt,, ttJt) 

(3) " st«^v>fcv>y^>^^£fTd. 

CO 0 6 3] M{3C10V'j/5^>^^^e®#i:Hi-^-r?. 
^■t•ClnS:^TV^ (S 6 7, S 7 1 ) , ftjS^^ -y^^V^/ 

2 (S6 9) o 

[0 0 6 4] M{^,^®^'V5^>^0(^CD{t^;KcD®l,'> 
tt. /SiB^i: 1 0 0%C5£V>v>y5^>^'§fTao -15. 
[0 0 6 5] M^Kfc||t-aLtV>fllSrlS 

[0 0 6 6] (2)^2 (D'^'ima^mm.'immm 

[ 0 0 6 7 ] ^ 1 ©^ffiff^^CDl^tts ^S5^-^(D» 
[0 0 6 8] ^2CDllW^^^C*V^T{i^ 



(3) 

n 

tk%mx^<r>. 0!l^«llm©^il^s<)«clnc*r^s■r5Jbs^ 

JOacfflM'fb * 0 S * CD t- S , 
[0 0 6 9] @6y:. ^2(D|^illg^l|gCD«|j5g@T*^S. 

^m^^mi> 1 8 cnngESB^ns^wtfigas i 

II 9i:. c:nK:g!iK$n5I^SfffFfiiiiSlJ2 Ofc. cititS 

i^^n2.gl5d5^^«2 3 i:> eitltg^i^^tlS^fa^J 

aHigi5 2 1 i:, c:ntSEigl$ni.M^^it^a5 2 2 i:, 
CtltgE«S^FnSBS^6 h, ^»f^fi£gS 1 9 

2 5, R^^fflW#^IS2 6 i:, 1 9 

«i^ti5^«^atBfflffSMifSE«2 7. m^mm^m 
mmmmmmz M^fflis««Ks8ES2 9 i:. 

6 OtCgf^^n 5*^^5 3 1 CltlCgEi^^tlS:!- 

•tfs#gi5 3 0 i:§^n^n*-r2>o 
»» [0 0 7 0] c:cDJ;a*«ji{c*5V>-c, ^2osisiyf^si 

[0 0 7 1] WT> ^2<Dmmmm<Dms.<Dmi'piL. ^ 

)feSiii^A;^g|5 1 6 1 J: !)i!c7 b-Acri;{c:A:t;$nfc 

7^>f i?^';i/H^7^-^ ^-s> m^mmm 1 7 cisift b 
(S7 1) . i^^^jsa^mi 8T-ff e.tifeif^.'^B 

om^^W t) W b, S^Sf'^fiK^m 1 9 <fc t mli^Oi^ 
S2 3, 2 7, 2 8, 2 9%f^figt-5o 
[0 0 7 2] S-r, Hi^AijSPl 6KJ:>}f»7U-Ar 

ti^xti^ntzy't^^ )vmm^xtiLmm.mm^ 1 7t 
tBSP 1 8 J; ») . mLtmmir<Dmmxmx'<Dmm^ 

d (S7 3) o ^bt, J?S#fi£a51 9fcj3l>Tli. 4^ 

S[J^oi£^lo'J^«^*^^:b•rt)■fe^ ■?ne>,^ia^i£:± 

m^if h)l^t^nm2 4tLXfF^-t?, (S7 5) o d 
" SI5^a«Jfi3?B^i!j-^fb©itei<4:{cJ;SS«^©-g|5©Rint 
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[0 0 7 3] HK*iirao^4;K'f >^^:u•c^ nmn 

(S7 7) o 

[00 7 4] »tt*^wo^ 5 5j«>f > h bt:±if 6>n 

v\ c:titj;b, nm7'-'^(oi$immmm^'i7o (s 

[0 0 7 5] Z.(Oi:.^i3M^^y:^)\'-k-^=-:J-T)Vnm 
i^c:tt:fl^-r.S (S8 1) o ^bTs ^»f^fiEg|519 

8 3) , ^bT, ^-i^^iS®^&^^^titM^t•:5V^■C^i^ 

[0 07 6] ^ b-^:^ 6 Tj^-r > h h br , ^ 

> 1- s ^ft^^ip e> b § I >ftSJ.:A± t ^-^ 
^ifc©^^* t) ©® tf!^- ^stR bt -7 y 5^ > ^ t^ffl -r s ^ 

i:-t> 5OJa3ift[qI±5fe0S*)<Z)t-ifeS (S8 5) c 
[0 0 7 7] A;^iii0{::MbTs ^!m\^m l 8 

Ji^g|52 2> )i3-&gi5 2 3C:}3v^■^:^ Mifimmf^oifzii 

[0 0 7 8] X, ^i^±^D^yf•>^^C:*iV^rx Stx * 

i^mmmm^m (27, 28, 29) tiBmb-c:}3<. 

[0 0 7 9] ^lCD||J(g}BSIt|gj;aK:> HlfeX 



(10) 

tt, Homage 2 8hits<D^^{/timmm^t 
i^xmrn bs * 7-iii^ic*f^i: Ltzm-^Kntxtjmm. 
(RGB) cDfefig^^ (1) sctbfei)So-c, mBm» 

[008 0] ifetia 8 14^ mm^(Dmm:nr>n<Dm^ 
ifei5iWbfc7D-f-^— b-efeD, s-fA^SPi eccfc 

igb (S9 1) . ^i^fflffigU 1 8t*DVAT> A>3 

10 bW-r (S9 3) , Mt, 0^^UigS{c*3V>-CH:«Jt) 
btl> M^^i^if^^nS (S9 5) o ^CDl^ttt^ 0 
2 7 C:Sr3<iWjg«ES#ffl bt:ii«^i)S}t^$nSo 

iitM^^jt^ffl^iS2 5 >in^wf^mmmm.mm 

i:|^S2 3fc®v^y^>^9"ASfft>tl, mS^Asjf^^n 
S (S9 7) o 

[0 0 8 1] ^f>tM^a5 6 0{C*5V>Ttt, 
9 9) , 

[0 0 8 2] ±Mbfc^CDdt. r^^j^CDtt- 

[0 0 8 3] (a) A*il04»t,CD!t^j^}fttti 

TtpiffltgiB^-rso i^^samsBi 8i±*^b^cd^3 
[0 0 8 4] xtimmtmmmm(D-yy^y'^. *5ii5 

a lott, M^f!i{c:ffl^t--5^iiim<feii©l5®S*f^i:b 
fe^^ti < J; t) v y ^>iJ^- btSggl-^fflll 

[0 0 8 5] ciz:-e{i*®<fca«:7^i^;^^i'#S**i!Si 
im^^tbiiitbiz [4] 

li0-^KilfflA;tJli0±Stov>T. [4] ojM^ff 
§^fflm;S{c:o^^■co2x2^f?U®SWfi<&«to 

5« i:bT^3io:)^^T-3-:^-^ifeambs 
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[0 0 8 6] felTCD«iaoXg*S9{c:»-:5^ift?a-r 

[0 0 8 7] laSiA^tjSIJl 6C:«tt)liffe*A*L 

fcf) (S 1 0 1) , fee.*^bK)JliS{::s obe 1 [5] 

2 xS:*;(ii^fii 
P^-2xg:*:il5^fi| 

- ^ p < g:#clilgfii 

"X" ^^iRjiScDS^tt 

[0 0 8 9] g^t =!-:»- i:!|uj^$tlfcai^P (x, 



WJ^«f 2fl{b [63 S:fT-5 (S 1 0 3) o 

LT. c:©Bfl§?^pf'(Z)#iimt3ov>t:, aiiii^3&«i®® 

■&{C01O (a) (DVj^i^t.tSconvo lut i o 
n^hSo ^tTlgm^iii^fitpoDfStio-C, 

[0 0 8 8] 

X" mmLi^ 

jiss^sbs (s 1 0 5) . -r*t>% "+" m.a> 

★*J^-rs (S 1 0 7) , STfilTOGradient 
matrix G^^ii>Se 
[0 0 9 0] 



G=Z (gx* gx gx< 

W (g y* gx gy^ 



gy) = (a 

gy) (c 



b) 

d) 



(8) 



A>/Lt 

gosgfittt. SQalR^^*)gl5ffl§ilm^it{;•ol^•rWJ^9•^f 
k: ^ o -c««i) o ^Mffl -r 5 o 
[0 0 9 1] (b) ^S©^m2:Elg 

(2) (a) (n-fj'mw^-^tx.fzw^mMwm^^aymm 
wxHli^©^i^l^^^i5®gp^iB^fe«r^aI^)mu 

ol^-^:©^g|5:S'^ij^'^^ h;u©it»i;S:*«)t, 

[0 0 9 2] b*»U ■y■>:r;^iSi)S#v^jl-^ttt±BB^ 

Dij-l/j -2 i Pi -Pj I 

j = 1 , —K " 
fc^b, Pj : lij^JF^©a5:9-II^©dt.> Il^Irt© 

[0 0 9 4] W±*±-tf>7';utov%Tfh#bfciSS© 
7"— :r;i'©fi»I^EIl iC^-To c:fi^)©SEil (^(^©D 
l2~DnD-0 ©MS^^^&^SKb. «a!^^©:9-fclt<& 
«;*ct-rsiaK:if3lsij^W^ [6] *jifflbTWfiS&* 
toS. c©i:tKfiiJ;>3iig||6Sj5A»ofc-t)->:r;Hi^rt 



(9) 

r> *^^©m4:4?'f >hi:bT©:5ST-{±. 
r;!/!!^ I rtgC©#gB^iil^Wi ( i = 1 , -N I ) © 

I) rt©gi5^ii^©a*.> ^ommtTi tcommifim. 

'J^^5:5^)©wj i:^©ggltPj **46i). cmcio 
•C^ wj ©S■t■■5■l^>7';^iffl0A»*^&f,nsfe^^), 
I f*i©g|5^iffliiwi fcTi'^-rsf&-b->7';niii©^ 

;Hi^Jiiiax ASiB^Ii:V'y5^bfet:^«f -So ifcZ 
©i:t©Ii:Jmax h©lig8IDij;&fiiTT-^ll^5. ■ 
[0 0 9 3] 

- (10) -- --- 

[0 0 9 5] *fe*^W©m5;}«'r>bi:bTo;?&-c 

fc^tjiffl-rst)©i:-rso ^-r. ^»t^ffl©^ii^ 

W-^:&lRlo^i&©)iMStv-h^5. dOctat-rsi: 

-fist i:«t t) d -y-^r^i/fflifiifflrtcaj^ii^oij^iHi 
[0 0 9 6] ^iz±.m<DV->y)m§k<D±&^m^iz-D 

v^•r^ SSi5^?'Iii«P5-e®iti&K8l*:^«)54s, tb^ 

Sias^il^wi *^t>«^^^fcfc^{cfte©g|J^il^wj AsgM 



1*Ba2000-99722(P2000-99722A) 



(12) 



X. mmt^'^y:^)mm\^o^^m^vri\, wi2, 

tit V> a ^Sf^i- 1 y ;^ 7*7 -Y if V*-?: 
[0 0 9 7] ^ e, Kl^Sft^as 1 9 T-tt. *%H^(!D^ 2 

2Pi X Vi 
i = l, k(cj;t)ffli^z:i:cDM««Aiii^&#fiE-r5o 

[0 0 99] g^t. =&-9->7';nB0 1 -k^mrnHm^ 

Cwi=S/Piax 

■SUt" So 



[Old] mmmm^<Dm'^ ^v-yyjm^omi 
mmim ■. m± 

Cwi<CT 

[0 10 2] (c) -^^i/^y^:^^ 

[ 0 1 0 3 ] xig 1 : A;tHiilil^*> e,a5^1iilfeS}atB 

^) . 

[0 1 0 4] xa3 : m^ST= (ft:*;m:i!®SH/A 
Pi (k) =Z I (j) X 

j = 1 , M 

tzHL. Mttse^ii^-iij^ ^;KD^>:^IS^ c (i) aas 
^mmi o^m^-mwL. dk (j) \mm^^<om^® 

[0 10 8] ^t)l;*^W©^7 4^'f >bi:UTOjlfS^ 



[0 0 9 8] ^•rffl^c:i:t^;5x.fc^i&-y->7';i'{;^ 
^^]!)S■9■>r;nii^rt•r•#^^^<feg^^■rfci&> gii^^cDfe 

S^fiSPi . mM^^Y)]/Vi (i = l-n) ^JfiiiSb 
(11) 

[0100] 

(12) 

^ e. ^c^■^©(g^^gp^^iii[|^#-r•®■T >^ 5^ y y t J; ssa 

(5ifflbfejtnHAv>o z.(Dtziiiiz. mx.\s.. ±5fiogp^ 
mmisxm [ 6 ] t J; & eif( c T ^aj^n^sE 

(13) 

i!ri'7 )-»©^iK«;*c55:5"i7'f >H">&SV^ne)i:^» 
[0 10 5] ±!B(::*i'>t> xgi, jM2, xgao 

iaa^M^gp 2 3 Asfri^. s fc. 1 ~xa 4 

^Ji^g|5 2 2*sf?5o g^tM«^fll^gI5 2 l{c:^5V^ 

x@i'-xig4K:i^i%t> ■y->7';i/iffl^if®Mia 

[0106] tan. ±mx(o I ^jt^fiti: b, mmnm 

±■^xgl{c^3v^-ctt^ ±@5-e3£'^feffli^zrji©^-^^ 
WiC^mw (3) " i:lH)J:ata5^il^li ©gfS2^ 
K'N©M^Pi (k) S:«i6Si:W^tm&%ltb)fcv 

[0 10 7] 



C (i) X dk (j) (14) 

^(ommnt^ (d (c) hmi^oiz^i^. ^mmn^xti 

m^t^^. ±l3i:l5lj:aK:gB^lj^*«J>)tBbs ^©^ 
W.\^<D$tmm (x, y) {Cji^bT (4) S;©T (x, 
y) (5) S©wl' ©iatc^ii, -fn&T (i) 

Pi (k) =Z I (j) xC (i) xdk (j) xT (i) (15) 
j = 1 , M©J;dfc:f -SCiits ffli^rii©giS^-^K " ^a^^ fi?S©{i^?£%itbfcDy^>^ h!&SII6A«prili: 
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[0 10 9] tt±v *^woA!ag;sii^atin«> 

iiSV^H^s Sfett-fcDiliacDli^o*i*ig|S{{c:*Jffl1-S 
[0 110] Sfc, MS^»i:Li:^cDfe5«^±#;oii- 

n e> o V 5=- > <5^i^«*«&-&!i!jj^-r s ^ t cfc o -c 
[0 111] dcDi^, mm^nm)^^~y(D^-9'^ 

7';i/iii^rfc£D5t:«tt*Mthfl<it?miiiiLTJT:**:-'^^J'- 

[0 112] g^jt^. M^^lv^3^^^l 

aCfSbt, Sit«:gP^>'t^->®^-^^&%/tbTffiffl 
igS5ttfB!&«b < |Bl±-r 5 CI tjbSqfllt^S. 

[0 113] #%:S:Sft*WT{;?^■ro 

[1]K. Sung and T. Poggio. , Example-based Learning for 

View-based Buian FaceDetection. In Proceedings fr 
om Image Understanding Workshop, pp. 843-8S0, Novembe 
r 1 994. 

[2] fftt : »^^i^^£ffl^^fcWtl^MfetB#^,- tt» 
m. 94-CV-90,pp. 1-8, 1994 

[3] lup. m#. ma. mmm^mi^^tzmm^i^T.T- 

A. iS^i^m. PRMU97-50.PP17-23, (1997) 
[4] Shi, Toiasi, Good Features to Track, IEEE Conferen 
ce on Computer Visionand Pattern Recognition (CVPR9 
4) Seattle, June 1994 

[5] I^tK. Tffl> "E^MSf^NyKT-y^" , 
tBfig^x pp. 553-554. 1991 

[6] WJ*5J;Vg/h-sllS^K:S-:3<giib*l^ 

mm^^. m^m. voi,j63-d, no.4,pp. 349-356, i98o 

[6] Si*, TBB : m§m9r^^> Y:rvif PP. 578 



(13) 

[7] ^'fi'-, "■^mm'm.^LWM'&^m^y^x. 

^Z^TKM^oy^ :f^—*J7.ii^V7-^yir^ ff®il. Vol. 
36 No. 10,pp2234-2243,Oct. 1995 
[0 114] 

[l8B.§©aS] i¥iiltgiiB^bfei:*)t), imm^ 

b-c»stiit-5m^a[**«), 

b-r43<Ci;TN ^^acDl^t, 0'J;i«^»^-^'As 

t)> «iSI«:iSi8^ll3ti-r.5j<yi'i{c||b-Cttv«j»^> 
[0 115] XMtC, i^»5^-^®#(i^ias (^-3;$/ 

[IS9(DlS#>5:ii^] 

[la 1 ] Jtc^^iS©^ 1 ^ff$Sik:^l,±^t:«^Elo 
[0 21 *^H^0^1^Mff^^t^^l!liiiS5!iac67D- 
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